The Qur'an Lexicon Project: A database of lexical statistics and phonotactic probabilities for 19, 286 contextually and phonetically transcribed types in Qur'anic Arabic
نویسندگان
چکیده
Reciting and memorizing the Qur’an forms a major part of religious practice for 1.6 billion Muslims around the world; in non-Arabic-speaking Muslim communities, it also provides Muslim speakers of other languages with their first exposure to the Arabic script and language. However, little research has been completed regarding the psycholinguistic processing of Qur’anic Arabic. In this paper, we present the first psycholinguistic database for Qur’anic Arabic, which comprises lexical variables (length: character, syllable, phone; frequency: item, syllable, biphone, phone; lexical uniqueness point, orthographic and phonological neighbourhood sizes, and orthographic and phonological Levenshtein distances) as well as phonotactic probabilities (positional segment and biphone) for 19,286 types that we contextually and phonetically transcribed based on Qur’anic recitation. This open-source resource will be useful for researchers studying Qur’anic Arabic lexical and phonological processing as well as for making systematic cross-linguistic comparisons that allow better delineation of language-specific and languagegeneral processes in language processing.
منابع مشابه
Tools for Arabic Natural Language Processing: a case study in qalqalah prosody
In this paper, we focus on the prosodic effect of qalqalah or "vibration" applied to a subset of Arabic consonants under certain constraints during correct Qur'anic recitation or taǧwīd, using our Boundary-Annotated Qur’an dataset of 77430 words (Brierley et al 2012; Sawalha et al 2014). These qalqalah events are rule-governed and are signified orthographically in the Arabic script. Hence they ...
متن کاملAn Exegetic Study of the Necessity of Criticism in Social Scenes as Found in the Holy Quran with an Emphasis on the Verse Mujadeleh
In the Islamic society, as being of the idea of progress, there is no proper place for criticism. One of the most important causes of this deficiency is the neglect of the necessity of criticism in the Qur'an. In this study, a comprehensive definition of criticism was presented, using lexical and interpretive sources. Then, since the word criticism was not used in the Qur'anic verses, the conce...
متن کاملThe sensitivity of children with SLI to phonotactic probabilities during lexical access.
UNLABELLED The procedural deficit hypothesis (Ullman & Pierpont, 2005) has been proposed to account for the combination of linguistic and nonlinguistic deficits observed in specific language impairment (SLI). According to this proposal, SLI results from a deficit in procedural memory that prevents children from developing sensitivity to probabilistic sequences, amongst other deficits. We tested...
متن کاملThe Effect of Lexicon-based Debates on the Felicity of Lexical Equivalents in Translating Literary Texts by Iranian EFL Learners
This study was an attempt to investigate the effect of lexicon-based debates on the felicity of lexical equivalents in translating literary texts by Iranian EFL learners. To fulfill the purpose of this study, 59 university students, majoring in English Translation, were randomly assigned to the experimental and control groups from a total of 73 students based on their performance on a mock TOE...
متن کاملDemonstration of Multi Statutory of the Adjective “Just” in Modern Adjectival English Lexicon
This article concerns the general functional features of an adjective in modern English, and in particular multistate lexical item “just”, which carries different meanings in different variants of combinatorics. The authors analyze the combinations used with the adjectival lexeme item “just” and reveal the categories that determine the degree of semantic content of each given statement. The nee...
متن کامل